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Abstract. D. Krieger and J. Shallit have proved that every real number greater 
than 1 is a critical exponent of some sequence fT). We show how this result can be 
derived from some general statements about sequences whose subsequences have 
(almost) maximal Kolmogorov complexity. In this way one can also construct a 
sequence that has no "approximate" fractional powers with exponent that exceeds 
a given value. 

1 Kolmogorov complexity of subsequences 

Let ft) = coqCOi . . . be an infinite binary sequence. For any finite set A C N let ft) (A) be a 
binary string of length #A formed by ft), with i G A (in the same order as in ft)). We want 
to construct a sequence ft) such that strings ft) (A) have high Kolmogorov complexity for 
all simple A. (See [3] for the definition and properties of Kolmogorov complexity. We 
use prefix complexity and denote it by K, but plain complexity can also be used with 
minimal changes.) 

Theorem 1. Let y be a positive real number less than 1. Then there exists a sequence 
CO and an integer N such that for any finite set A of cardinality at least N the inequality 

K(A,co(A)\t) ^ j-#A 

holds for some t £ A. 

Here K(A, ft)(A) |f ) is conditional Kolmogorov complexity of a pair (A, ft)(A)) rela- 
tive to t . 

Proof. This result is a consequence of Lovasz local lemma (see, e.g., [4] for a proof): 
Lemma. Assume that a finite sequence of events Ai , . . . ,A„ is given, for each i some 

subset N(i) C {1, . . . ,«} of "neighbors" is fixed, positive reals £\ , . . . , £„ are chosen in 

such a way that 

and for every i the event A, is independent of the family of all Aj with j £ N(i),j ^ i. 
Then the probability of the event "not A\ and not A2 and. . . and not A„" is at least 
(l-ei)-...-(l-e B ). 

The standard compactness argument shows that it is enough (for some N; the choice 
of N will be explained later) to construct an arbitrarily long finite sequence co that sat- 
isfies the statement of TheoremQ] Let us fix the desired length of this (long) sequence. 
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For any set A (whose elements do not exceed this length) and any string Z of length |A | 
such that K{A,Z\t) < y-#A for every t £ A consider the event CO {A) = Z; the set A is 
callled the support of this event. We have to prove that the complements of these events 
have non-empty intersection. 

This is done by using Lovasz lemma. Let us choose some j3 between 7 and 1 . Let 
e, be 2~P S where s is the size of support of jth event. For each event 03(A) = Z the 
neighbor events are events Co(A') = Z' such that the supports A and A' have nonempty 
intersection. Let us check the assumptions of Lovasz lemma. 

First, an event A, is independent of any family of events whose supports do not 
intersect the support of A,-. 

Second, let ©(A) = Z be an event and let n be the cardinality of A. The probability 
of this event is 2~ n . We have to check that 2~" does not exceed lr$ n multiplied by the 
product of ( 1 — 2~^™) factors for all neighbor events (where m is the size of the support 
of the corresponding events). 

This product can be split into parts according to possible intersection points. (If 
there are several intersection points, let us select and fix one of them.) Then for any 
t €A and for any m there is at most 2^" factors that belong to the f-part and have size 
m, since there exist at most 2? m objects that have complexity less than ym (relative to t ). 
Then we take a product over all m and multiply the results for all t (there are n of them). 
The condition of Lovasz lemma (that we need to check) gets the form 

2-"^2-P n Y[(l-2- pm ) 27mn 

m>N 

or (after we remove the common exponent n) 

m>N 

Bernoulli inequality guarantees that this is true if 

2^- x < 1 - £ 2 ~ f,n 2-P m 

m>N 

Since the left hand side is less than 1 and the geometric series converges, this inequality 
is true for a suitable N. (Let us repeat how the proof goes: we start with j3 6 (7, 1), then 
we choose using the convergence of the series, then for any finite number of events 
we apply Lovasz lemma, and then we use compactness.) 
(End of proof) 

The inequality established in this theorem has an useful corollary: 

K{co{A)\t) > y-#A-K(A\t)-0(l), 

since ^T(A,a)(A) |?) < K(A\t) +K(co(A)\t) +0(1). For example, if A is an interval, then 
K(A\t) is o(#A), so this term (as well as an additive constant 0(1)) can be absorbed by 
a small change in 7 and we obtain the following corollary ("Levin's lemma", see [2 1 for 
a discussion and further references): for any 7 < 1 there exists a sequence CO such that 
all its substrings of sufficiently large length n have complexity at least yn. 
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2 Critical exponents 

Let X be a string over some alphabet, and let Y be its prefix. Then the string Z = X . . . XY 
is called ^fractional power ofX and the ratio |Z| / \X | is its exponent. A critical exponent 
of an infinite sequnce CO is the least upper bound of all exponents of fractional powers 
that are substrings of CO. D. Krieger and J. Shallit [1] have proved the following result: 

Theorem 2. For any real a > 1 there exists an infinite sequence that has critical expo- 
nent a. 

Informally speaking, when constructing such a sequence, we need to achieve two 
goals. First, we have to guarantee (for rational numbers r less than a but arbitrarily 
close to a) that our sequence contains r-powers; second, we have to guarantee that it 
does not contain ^-powers for q > a. Each goal is easy to achieve when considered 
separately. For the first one, we can just insert some r-power for every rational r < a. 
For the second goal we can use the sequence with complex substrings: since every q- 
power has complexity about 1 jq of its length (the number of free bits in it), Levin's 
sequence does not contain long g-powers if q > 1/y. 

The real problem is to combine these two goals: after we fix the repetition pattern 
needed to ensure the first requirement (i.e., after decide which bits in a sequence should 
coincide) we need to choose the values of the "free" bits in such a way that no other 
(significant) repetitions arise. For that, let us first prove some general statement about 
Kolmogorov complexity of subsequences in the case when some bits are repeated. 

3 Complexity for sequences with repetitions 

Let ~ be an equivalence relation on N. We assume that all equivalence classes are finite 
and the relation itself is computable; moreover, we assume that for a given x one can 
effectively list the x's equivalence class. This relation is used as a repetition pattern: we 
consider only sequences CO that follows ~, i.e., only sequences CO such that ft), = C0j if 
i ~ j. For any set A c N we consider the number of free bits in A, i.e., the number of 
equivalence classes that have a non-empty intersection with A; it is denoted #yA in the 
sequel. 

There are countably many equivalence classes. Let us assign natural numbers to 
them (say, in the increasing order of minimal elements) and let c(z) be the number of 
equivalence class that contains i. Then every sequence CO that follows the repetition 
pattern ~ has the form to, = T c /f> for some function c : N — > N. 

Now we assume that the equivalence relation ~ (as explained above) and a constant 
y < 1 are fixed. 

Theorem 3. There exists a sequence co that follows the pattern ~ and an integer N 
with the following properties: for every finite set A with #fA^ N there exists t 6 A such 
that 

K(co(A)\t) ^ Y-#fA-K{A\t)-logm(t) 
where m(t) is the "multiplicity" oft, i.e., the number of bits in its equivalence class. 
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(Note that if all equivalence classes are singletons, then logm(f) disappears, #fA is 
the cardinality of A and we get an already mentioned corollary.) 

Proof. Let 0), = T c ^ where T is a sequence that satisfies the statement of TheoremQ] 
(with the same 7). For any A let B be the set of all c(i) for i E A. Then #B = #/A. 
Theorem Q] guarantees that K(B,t(B)\u) ^ y-#B for some u £ B. Since u 6 B, there 
exists some t £ A such that c(t ) = m. To specify f when u is known, we need logm(f ) bits, 
so K(t\u) ^ log m(t) + 0(1). After t is known, we need K(A\t) additional bits to specify 
A and K((o(A)\t) bits to specify to (A). Knowing A and to (A), we then reconstruct B and 
x(B). Therefore, 

y-#B < K{B,t{B)\u) < logm(f) + #(A|?) + A"(o)(A)|f) + 0(1), 

which implies the desired inequality (with additional term (9(1), which can be compen- 
sated by a small change in 7). 

4 Construction 

Assume that 1 < a < j5. First, let us show that Theorem [3] implies the existence of a 
binary sequence to that contains fractional powers of all rational exponents less than a, 
but does not contain long fractional powers of exponents greater than j3 . 

To construct such a sequence, let ri,^,... be all rational numbers between 1 and 
a. For each r, = pi/qt we "implant" a fractional power of exponent r, in the sequence: 
we select some interval of length pt and decide that this interval should be a fractional 
power of some string of length qt (and exponent r,). This means that we declare two 
indices in this interval equivalent if they differ by a multiple of q,. (The intervals for 
different i are disjoint.) We call these intervals active intervals. We assume that distance 
between two active intervals is much bigger than the lengths of these two intervals (see 
below why this is useful). 

X\ ^ Y\ X2 ^ Y2 

^ n\x^\ ^ ^ r^\x~ 2 \ ^ 

Fig. 1. Two fractional powers of exponent r\ and r2 are implanted; F, is a prefix of Xj 
(in this example the exponents are less than 2, so only one full period is shown). 

Evidently, any sequence that follows this repetition pattern has critical exponent at 
least a. 

Let us choose some 7 between a/j3 and 1 and apply Theorem [3] with this 7 to the 
pattern explained above. We get a bit sequence; let us prove that it does not contain long 
fractional powers of exponent greater than j3 . Indeed, it is easy to see that density of 
free bits in this pattern is at least 1 /alpha, i.e., for any interval A of length I the number 
of free bits in it, «/A, is at least / / a. Indeed, if A intersects with two or more active 
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intervals, then all bits between them are free, and the distance between the intervals is 
large compared to interval sizes. Then we may assume that A intersects with only one 
active interval. All subintervals of the active interval have the same repetitions period, 
and the density of free bits is minimal when A is maximal, i.e., coincides with the entire 
active interval. The bits outside the active interval are free (no equivalences), so they 
can only increase the fraction of free bits. 

On the other hand, a fractional power of exponent j3 and length / has complexity 
//j3 +0(\ogl) (we specify the length of the string and bits that form the period). 
For long enough strings we then get a contradiction with the statement of Theorem [3] 
since a/j3 < y. 

To get rid of short fractional powers of exponent greater than j3 we can add ad- 
ditional layer of symbols that prevents them. In other terms, consider a sequence in a 
finite alphabet that follows (almost) the same repetition pattern but has no other repe- 
titions (not prescribed by the pattern) on short distances. It is easy to construct such a 
sequence; for example, we may assume that qi is a multiple of 2! and then consider a 
periodic sequence with any large period M; it will destroy all periods that are not mul- 
tiple of M, i.e., all short periods and only finitely many of qi (the latter does not change 
the critical exponent). The Cartesian product of these two sequences (/th letter is a pair 
formed by /th letters of both sequences) has critical exponent between a and j3 . 

In fact, we even get a stronger result: 

Theorem 4. For any a and j3 such that 1 < a < j3 there exist a sequence co that has 
fractional powers of exponent r for all r < a but does not have approximate fractional 
powers of exponent j3 or more: there exists some £ > such that any substring of length 
n is En-far from any fractional power in terms of Hamming distance (we need to change 
at least £n symbols of the sequence to get a fractional power of length n). 

Indeed, a change of e-fraction bits in a sequence of length n increases its complexity 
at most by H(e)n + (9(logn) where 

//(e) = -eloge- (1 -e)log(l -e). 

Therefore, we need to change a constant fraction of bits to compensate for the difference 
in complexities (between the lower bound guaranteed by Theorem [3] and the upper 
bound due to approximate periodicity). (End of proof.) 

5 Critical exponent: exact bound 

The same construction (with some refinement) can be used to get a sequence with given 
critical exponent. 

Theorem 5. (Krieger - Shallit) For any real number a > 1 there exists a sequence that 
has critical exponent a. 

(This proof follows the suggestions of D. Krieger who informed the author about the 
problem and suggested to apply TheoremQ]to it- See [1] for the original proof. Author 
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thanks D. Krieger for the explanations and both authors of [1 1 for the permission to cite 
their paper.) 

Again, let us consider repetition pattern that guarantees all exponents less than a 
and apply Theorem [3] with some y close to 1. This (as we have seen) prevents powers 
with exponents greater that a/y; the problem is how to get rid of intermediate expo- 
nents. 

To do this, we should distinguish between two possibilities: (a) an unwanted power 
is an extension of the prescribed one (has the same period that unexpectedly has more 
repetitions) and (b) an unwanted power is not an extension. The first type of unwanted 
powers can be prevented by adding brackets around each active interval (in a special 
layer: we take a Cartesian product of the sequence and this layer). 

It remains to explain why unwanted repetitions of the second type do not exist (for y 
close enough to 1). Consider any fractional power with exponent greater than a. There 
are two possibilities: 

(1) It intersect at least two active intervals. Then it contains all free bits between 
these intervals, and (since we assume that the distances are large compared to the length 
of intervals) the density of free bits is close to 1, so exponent greater than a is impossi- 
ble. 

(2) It intersects only one active interval. The same argument (about density of free 
bits) shows that if the endpoints of this fractional power deviate significantly from the 
endpoints of the active interval, then the density of free bits is significantly greater 
than 1 1 a and we again get a contradiction. Therefore, taking y close to 1 we may 
guarantee that the distance between endpoints of fractional power and active interval is 
a small fraction of the length of the active interval. Then we get two different periods 
in the intersection of fractional power and active interval. One ("old") is inherited from 
the repetition pattern; the second one ("new") is due to the fact that we consider a 
fractional power. (The periods are different, otherwise we are in the case (1).) The 
period lengths are close to each other. Indeed, if the new period is significantly longer, 
then the exponent is less than a; if the new period is significantly shorter, then the 
complexity bound decreases and we again get a contradiction. 

Now note that two periods t\ and ?2 in a string guarantee the period t\ — ti near the 
endpoints of this string (at the distance equal to the difference between string length 
and minimal of these periods). Therefore we get a period that is a small fraction of the 
string length at an interval whose length is a non-negligible fraction of the string length. 
This again significantly decreases the complexity of the string, and this contradicts the 
lower bound of the complexity. (End of the proof.) 

Remark. This proof uses some parameters that have to be chosen properly. For a 
given a we choose y that is close enough to 1 and makes the arguments about "suffi- 
ciently small" and "significantly different" things in the last paragraph valid for long 
strings. Then we choose the repetition patterns where length of active intervals are mul- 
tiples of factorials and the distances between them grow much faster than the lengths of 
active intervals. Then we apply Theorem[3]for this pattern. Finally, we look at the length 
N provided by this theorem and prevent all shorter periods by an additional layer. An- 
other layes is used for brackets. These layers destroy only finitely many of prescribed 
patterns and unwanted short periods. 
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